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Abstract 

Korner and Marton established the capacity region for the 2-receiver broadcast channel with degraded message 
sets. Recent results and conjectures suggest that a straightforward extension of the Korner-Marton region to more 
OO , than 2 receivers is optimal. This paper shows that this is not the case. We establish the capacity region for a class 

of 3-receiver broadcast channels with 2 degraded message sets and show that it can be strictly larger than the 
straightforward extension of the Korner-Marton region. The key new idea is indirect decoding, whereby a receiver 
who cannot directly decode a cloud center, finds it indirectly by decoding satellite codewords. This idea is then 
used to establish new inner and outer bounds on the capacity region of the general 3-receiver broadcast channel 
with 2 and 3 degraded message sets. We show that these bounds are tight for some nontrivial cases. The results 
suggest that the capacity of the 3-receiver broadcast channel with degraded message sets is as at least as hard to 
find as the capacity of the general 2-receiver broadcast channel with common and private message. 

I. Introduction 

c/j ! A broadcast channel with degraded message sets represents a scenario where a sender wishes to 
' communicate a common message to all receivers, a first private message to a first subset of the receivers, 
^ a second private message to a second subset of the first subset and so on. Such scenario can arise, for 
j> example, in video or music broadcasting over a wireless network to nested subsets of receivers at varying 
', levels of quality. The common message represents the lowest quality version to be sent to all receivers, 
the first private message represents the additional information needed for the first subset of receivers to 
decode the second lowest quality version, and so on. What is the set of simultaneously achievable rates 
fsj for communicating such degraded message sets over the network? 

^ This question was first studied by Korner and Marton in 1977 [1]. They considered a general 2-receiver 
^ discrete-memory less broadcast channel with sender X and receivers Yi and Y2. A common message 
Mo G [1, 2"-^"] is to be sent to both receivers and a private message Mi G [1, 2"^^] is to be sent only to 
receiver Yi. They showed that the capacity region is given by the set of all rate pairs (i?o, Ri) such thatQ 

i?o<min{/(f/;Fi),/(f/;F2)}, (D 
Ri<I{X-Yi\U), 

for some p{u,x). These rates are achieved using superposition coding [2]. The common message is 
represented by the auxiliary random variable U and the private message is superimposed to generate X. 
The main contribution of [1] is proving a strong converse using the technique of images-of-a-set [3]. 

Extending the Korner-Marton result to more than 2 receivers has remained open even for the simple 
case of 3 receivers Yi,Y2, Y^ with 2 degraded message sets, where a common message Mq is to be sent 
to all receivers and a private message Mi is to be sent only to receiver Yi. The straightforward extension 
of the Korner-Marton region to this case yields the achievable rate region consisting of the set of all rate 
pairs {Ro,Ri) such that 

i?o < min{/(f/; Fi), I{U; Y2), I{U- Fg)}, (2) 
Ri<I{X-Yi\U), 

Chandra Nair was partly supported by the Direct Grant for research at the Chinese University of Hong Kong 

'The Korner-Marton characterization does not include the second term inside the min in the first inequality, I(U\ Y\). Instead it includes 
the bound Ri + R2 < Yi). It can be easily shown that the two characterizations are equivalent. 
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for some p{u,x). Is this region optimal? 

In [4], it was shown that the above region (and its natural extension to > 3 receivers) is optimal 
for a class of product discrete-memory less and Gaussian broadcast channels, where each of the receivers 
who decode only the common message is a degraded version of the unique receiver that also decodes 
the private message. In [5], it was shown that a straightforward extension of Komer-Marton region is 
optimal for the class of linear deterministic broadcast channels, where the operations are performed in 
a finite field. In addition to establishing the degraded message set capacity for this class the authors 
gave an explicit characterization of the optimal auxiliary random variables. In a recent paper Borade et 
al. [6] introduced multilevel broadcast channels, which combine aspects of degraded broadcast channels 
and broadcast channels with degraded message sets. They established an achievable rate region as well as 
a "mirror-image" outer bound for these channels. Their achievable rate region is again a straightforward 
extension of the Korner-Marton region to A; -receiver multilevel broadcast channels. In particular. Conjecture 
5 of [6] states that the capacity region for the 3-receiver multilevel broadcast channels depicted in Figure [T] 
is the set of all rate pairs (i?o, Ri) such that 

Ro<min{I{U;Y2),I{U;Y,)}, (3) 
R,<I{X;Y,\U), 

for some p(u, x). Note that this region, henceforth referred to as the BZT region, is the same as ^ because 
in the multilevel broadcast channel Y^ is a degraded version of Yi and therefore I{U ; Y3) < I{U; Yi). 



O 
Ys 



O 
Y2 

Fig. 1. Multilevel 3-receiver broadcast channels. Message Mq is to be sent to all receivers and message Mi is to be sent only to Yi. 

In this paper we show that the straightforward extension of the Komer-Marton region to more than 2 
receivers is not in general optimal. We establish the capacity region of the multilevel broadcast channels 
depicted in Figure [Das the set of rate pairs (Rq, Ri) such that 

R^<mm{I{U2■,Y2),I{U^■,Y,)}, 
Ro + Ri< mm{I{U^; Y^) + I{X; Y^\U,), /(f/2; Y^) + I{X; Y^p^)}, 

for some p{ui)p{u2\ui)p{x\u2) , and show that it can be strictly larger than the BZT region. In our coding 
scheme, the common message Mq is represented by Ui (the cloud centers), part of Mi is superimposed 
on Ui to obtain U2 (satellite codewords), and the rest of Mi is superimposed on U2 to yield X. Receivers 
Yi and Y^ find Mq by decoding Ui, whereas receiver Y2 who may not be able to directly decode Ui, 
finds Mq indirectly by decoding a list of satellite codewords. After decoding Ui, receiver Yi finds Mi by 
proceeding to decode U2 then X. 

The rest of the paper is organized as follows. In Section UIl we provide needed definitions. In Sec- 
tion Unl we establish the capacity region for the multilevel broadcast channel in Figure [T] (Theorem [T]). 
In Section |IVl we show through an example that the capacity region for the multilevel broadcast channel 
can be strictly larger than the BZT region. In Section |Vl we extend the results on the multilevel broadcast 
channel to establish inner and outer bounds on the capacity region of the general 3-receiver broadcast 
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channel with 2 degraded message sets (Propositions [5] and (6]). We show that these bounds are tight when 
Yi is less noisy than Y3 (Proposition |7]), which is a more relaxed condition than the degradedness condition 
of the multilevel broadcast channel model. We then extend the inner bound to 3 degraded message sets 
(Theorem |2l). Although Proposition [5] is a special case of Theorem |2l it is presented earlier for clarity 
of exposition. Finally, we show that the inner bound of Theorem [2] when specialized to the case of 2 
degraded message sets, where Mq is to be sent to all receivers and Mi is to be sent to Yi and Y2, reduces 
to the straightforward extension of the Komer-Marton region (Corollary [T]). We show that this inner bound 
is tight for deterministic broadcast channels (Proposition [8]) and when Yi is less noisy than Y3 and Y2 is 
less noisy than Y^ (Proposition |9l). Finally, we outline a general approach to obtaining inner bounds on 
capacity for general /c-receiver broadcast channel scenarios that uses the new idea of indirect decoding. 

II. Definitions 

Consider a discrete-memoryless 3-receiver broadcast channel consisting of an input alphabet X, output 
alphabets 3^i, 3^2 and y^, and a probability transition function p(yi,y2,y3\x). 

A (2"^", 2"^i, n) 2-degraded message set code for a 3-receiver broadcast channel consists of (i) a 
pair of messages (Mo,Mi) uniformly distributed over [1,2"^'^] x [1,2"^^], (ii) an encoder that assigns a 
codeword x"'(mo,mi), for each message pair (mo, mi) G [1,2"-^"] x [l,2"-^i], and (iii) three decoders, 
one that maps each received sequence into an estimate (moi; '^i) ^ [l? 2"^"] x [1, 2"^^], a second that 
maps each received ?/2 sequence into an estimate mo2 £ [1, 2"^°], and a third that maps each received 
sequence into an estimate rrios G [1,2"^"]. 

The probability of error is defined as 

Pi") = P{Mi ^ Ml or Mofe 7^ Mo for = 1, 2, or 3}. 

A rate tuple (Rq, Ri) is said to be achievable if there exists a sequence of (2"^", 2"^\ 2-degraded 
message set codes with Pi^^ — 0. The capacity region of the broadcast channel is the closure of the set 
of achievable rates. 

A 3-receiver multilevel broadcast channel [6] is a 3-receiver broadcast channel with 2 degraded message 
sets where p{yi, y2, yslx) = p{yi, y2\x)p{y3\yi) for every (x, yi, 1/2, 2/3) e x 3^1 x 3^2 x 3^3 (see Figure [I])- 

In addition to considering the multilevel 3-receiver broadcast channel and the general 3-receiver broad- 
cast channel with 2 degraded message sets, we shall also consider the following two scenarios: 

1) 3-receiver broadcast channel with 3 message sets, where Mq is to be sent to all receivers. Mi is to 
be sent to Yi and Y2, and Af2 is to be sent only to Yi. 

2) 3-receiver broadcast channel with 2 degraded message sets, where Mo is to be sent all receivers 
and Ml is to be sent to both Yi and Y2. 

Definitions of codes, achievability and capacity regions for these cases are straightforward extensions of 
the above definitions. Clearly, the 2 degraded message set scenarios are special cases of the 3 degraded 
message set. Nevertheless, we shall begin with the special class of multilevel broadcast channel because 
we are able to establish its capacity region. 

III. Capacity of 3-Receiver Multilevel Broadcast Channel 
The key result of this paper is given in the following theorem. 

Theorem 1: The capacity region of the 3-receiver multilevel broadcast channel in Figure [His the set 
of rate pairs (Pi, P2) such that 

Po<min{/(f/i;y3),/(f/2;1^2)}, (4) 
Po + Pi < min{/(f/i; Y^) + /(X; Fi|f/i), /(f/2; Y2) + I{X; Filf/2), }. 

for some p{ui)p{u2\ui)p(x\u2) , where the cardinalities of the auxiliary random variables satisfy ||Wi|| < 
\\X\\+4: and \\U2\\ < + 5\\X\\ + 4. 
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Remarks: 

1) It is easy to show by setting Ui = U2 = U m the above theorem that the BZT region ([3]) is contained 
in the capacity region dH). We show in the next section that the capacity region can be strictly 
larger the BZT region. 

2) It is straightforward to show that the above region is convex and therefore there is no need to use 
a time- sharing auxiliary random variable. 

The proof of Theorem [T] is given in the following subsections. 

A. Converse 

We show that the region in Theorem [T] forms an outer bound to the capacity region. The proof is quite 
similar to previous weak converse and outer bound proofs for 2-receiver broadcast channels (e.g., see [7], 
[8], [9]). Suppose we are given a sequence of codes for the multilevel broadcast channel with Pi"-* 0. 
For each code, we form the empirical distribution for Mq, Mi,X". 

Since X ^ Fi ^ forms a physically degraded broadcast channel, it follows that the code rate pair 
(i?o, Ri) must satisfy the inequalities 

R^<I{X;Y,\U^), 

for some p{ui, x) [7], where Ui, X, Yi, I3 are defined as follows. Let Uu = (Mq, Y^~^), i = 1, . . . ,n, and 
let Q be a time-sharing random variable uniformly distributed over the set {1,2, ...,n} and independent 
of X", 17, Y^. We then set Ui = (Q, Uiq) and X = Xq, Yi = Y^q, and = Y^q. Thus, we have 
shown that 

Ro<HUi;Y,), 
Ro + Ri<nUi;Ys)+IiX;Y^\Ui). 

Next, since the decoding requirements of the broadcast channel X — > (Yi,F2) makes it a broadcast 
channel with degraded message sets, the code rate pair must satisfy the inequalities 

Ro<mm{I{U2;Y,),I{U2,Y,)}, 
Ro + Ri<I{U2]Y2)+I{X-Yi\U2), 

for some p{u2, x) [8], where U2 is identified as follows. Let U2i = (Mq, l^"^, Y^ 1), z = 1, . . . , ra, then 
we set U2 = {Q,U2q). 

Combining the above two outer bounds, we see that Ui ^ U2 ^ X forms a Markov chain. Observe 
that this Markov nature of the auxiliary random variables along with the degraded nature of X — > Fi — > 
implies that /(f/2; Fi) > /(f/s; F3) > I{U^- F3). 

Thus we have shown that the code rate pair (_Ro) Ri) must satisfy the inequalities 

Po<min{/([/i;F3),/([/2;F2)}, 
Ro + Ri< min{/([/i; F3) + /(X; Fi|f/i), I{U2] Y2) + /(X; Filf/2)}, 

for some p{ui)p{u2\ui)p{x\u2) . This establishes the converse to Theorem [B 

B. Achievability 

The interesting part of the proof of Theorem [T] is achievability. Specifically, step 3 of the decoding 
procedure for Case 2 below describes a key contribution of this paper. We show how F2 can find Mq 
without directly decoding t/" or uniquely decoding t/^. 
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To show achievability of any rate pair (Rq, Ri) in region (Hj), because of its convexity, it suffices to 
show the achievability of any rate pair (_Ro) Ri) such that 

Ro = min{/(f/i; Fg), /(f/2; Y2)} - 6 
Ro + Ri= min{/(f/i; Y,) + /(X; Y^\U^), /(t/2, Y2) + /(X; Y^\U2)} - 36, 

for some Ui ^ U2 ^ X and any 5 > 0. 
Rewriting the second inequality we obtain 

R^ + Ri = I{Ui; F3) + mm{I{U2; Y^\Ui), /(f/2; Y,) - I{Uu Y^)} + /(X; Filf/2) - 35. 

Now consider the following two cases: 

Case 1: I{Ui; F3) > /(f/2; Y2): The rates reduce to 

R^ = I{U2;Y2)-6 
Ri = I{X-Yi\U2)-25. 

This pair can be achieved via a simple superposition coding scheme [2]. 
Case 2: /(f/i; Y3) < /(f/2; F2): The rates reduce to 

Ro = I{Ur-Y^)-5 

R, = /(X; Filf/2) + min{/(f/2; Yi|f/i), /(f/2, 1^2) - /(f^i; l^a)} - 25. 
Let 5i = min{/(f/2; Fi|f/i), /(f/2, ^2) - /(f^i; n)} - ^ and S2 = /(X; rilf/2) - S, then /?i = 5i + ^2. 
CoJe Generation: 

¥\x p{ui)p{u2\ui)p{x\u2) that satisfies the condition of Case 2. Generate 2"^" = 2"(^('^i'^3)^'^) sequences 
f/"(l), . . . , UiiT^^^) distributed uniformly at random over the set of e-typicafl f/" sequences, where 5^0 
as e ^ 0. For each f/i"(mo), generate 2"^^ = 2"(°^'>^i^(^2;i'i|i/i),/(c/2,y2)-/(f/i;y3)}-5) sequences f/2"(mo,l), 
f/^(mo,2), . . ., f/^(mo, 2""^^) distributed uniformly at random over the set of conditionally e-typical f/^ 
sequences. For each f/^(mo,Si) generate 2"'^2 = 2"(-^(^''^il^2)-<5) sequences X"(mo,Si, 1), X"(mo,Si,2), 
. . ., X"(mo, Si, 2"'^2^ distributed uniformly at random over the set of conditionally e-typical X" sequences. 

Encoding: 

To send the message pair (mo, mi) G [1, 2"-^"] x [1, 2"^^], the sender expresses mi by the pair (si, S2) G 
[l,2"^i] X [1,2"^2] and sends X"(mo, Si, Sa). 

Decoding and Analysis of Error Probability: 

1) Receiver Y^ declares that mg is sent if it is the unique message such that f/"(mo) and Y^ are jointly 
e-typical. It is easy to see that this can be achieved with arbitrarily small probability of error because 

Ro = I{UuY^)-5. 

2) Receiver Yi first declares that mo is sent if it is the unique message such that f/"(mo) and Y"" 
are jointly e-typical. This decoding step succeeds with arbitrarily high probability because Rq = 
/(f/i; Fs) — S < /(f/i; Fi) — S. It then declares that si is sent if it is the unique index such that 
f/^(mo, si) and are jointly e-typical. This decoding step succeeds with arbitrarily high probability 
because 5*1 < /(f/2; Yi\Ui) — 5. Finally it declares that S2 is sent if it is the unique index such that 
X"(mo,Si,S2) and F" are jointly e-typical. This decoding step succeeds with high probability 
because ^2 = /(X; Filf/2) - 6. 

3) Receiver F2 finds Mq as follows. It declares that mo G [1, 2"^"] is sent if it is the unique index such 
that f/^(mo,si) and Y2 are jointly e-typical for some si G [l,2"'^i]. Suppose (1,1) G [1,2"^"] x 

^We assume strong typicality throughout this paper [10]. 
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[1, 2'^^^] is the message pair sent, then the probability of error averaged over the choice of codebooks 
can be upper bounded as follows 

pin) < p{([/2"(l, 1), r^") are not jointly e-typical} 

+ P{(f^2'("^o, Si), Y2) are jointly e-typical for some niQ 7^ 1} 
^/ ^ 2n{R^+s^) J2 J2 P{(f^2"(^o, ^i), 1^) jointly e-typical} 

(b) 
(c) 

<5' + 2""^ 

where (a) follows by the union of events bound, (6) follows by the fact that for mo 7^ 1, ?7|'(mo, si) 
and F2" generated completely independently and thus each probability term under the sum is 
upper bounded by 2-"(^(^2'^2)-5) [iq], (c) follows because by construction Ro + Si< /([/a; F2) -25, 
5' ^ as e ^ 0. Thus with arbitrarily high probability, any jointly e-typical f/^(mo, Si) with the 
received Y2 sequence must be of the form f/;^(l,si), and receiver Y2 can correctly decodes Mq 
with arbitrarily small probability of error. Note that Y2 may or may not be able to uniquely decode 
1/2(1, 1)- However, it finds the correct common message with arbitrarily small probability of error 
even if its rate Rq > I{Ui; F2)! 

Thus all receivers can decode their intended messages with arbitrarily small probability of error and 
hence the rate pair i?o = HUv, F3) -S,Ri = /(X; Filf/a) +min{/(f/2; Yi\Ui), /(f/2, F2) - /(f/i; Y3)} - 26 
is achievable. This completes the proof of achievability of Theorem [U 

Remarks: 

1) We denote the decoding technique used in step 3 as indirect decoding, since Y2 decodes the cloud 
center Ui indirectly by decoding satellite codewords. 

2) There is no need to break up the coding scheme into two cases. The coding scheme for Case 
2 suffices. This will become clear when we prove the achievable region for the general case of 
3-receivers with 2 degraded message sets in Proposition [51 



C. Bounds on Cardinality 

Using the strengthened Caratheodory theorem by Fenchel and Eggleston [11] it can be readily shown 
that for any choice of the auxiliary random variable Ui, there exists a random variable Vi with cardinality 
bounded by + 1 such that /(t/i;F3) = /(Vi;Ks) and I{X;Yi\Ui) = I{X;Yi\Vi). Similarly for 
any choice of U2, one can obtain a random variable V2 with cardinality bounded by HA"!! + 1 such that 
Y2) = 1(12; Y2) and /(X; Filf/2) = /(X; Yi\V2). While this preserves the region, it is not clear that 
the new random variables Vi, V2 would preserve the Markov condition Vi ^ V2 — > X. To circumvent 
this problem we adapt arguments from [11] to establish the cardinality bounds stated in Theorem [U For 
completeness, we provide an outline of the argument. 

Given Ui ^ U2 ^ X ^ (Fi, Y2, Y3), we need to show the existence of random variables Vi, V2 such that 
the following conditions hold: Vi — > V2 ^ X forms a Markov chain, I{Vi; I3) = /(f/i; I3), /(V2; I2) = 
I{U2]Y2), /(X;Fi|Vi) = I{X;Yi\Vi), and I{X;Yi\V2) = I{X;Yi\U2). Further, the cardinalities of the 
new random variables must satisfy ||Vi|| < + 4, IIV2II < W^W^ + + 4. 

This argument is proved in two steps. In the first step a random variable Vi and transition probabilities 
p{u2\vi) are constructed such that the following are held constant: p{x), the marginal probability of X 
(and hence Yi,Y2,Ys), H{Yi\Ui), H{Y2\Ui), H{Y-i\Ui), H(Y2\U2,Ui), and H(Yi\U2,Ui). Using standard 
arguments [12], [11], there exists a random variable Vi and transition probabilities p{u2\vi), with cardi- 
nality of Vi bounded by 11^*11+4, such that the above equalities are achieved. In particular the marginals 
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of X,Yi,Y2,Y3, are held constant. However the distribution of U2 is not necessarily held constant and 
hence we shall denote the resulting random variable as f/g. 
We thus have random variables Vi ^ U2 ^ X such that 

J(\/i;F3) =/(f/i;1^3), 
I{V^,Y2)=I{U,;Y2), 
I{X;Yi\Vi)=I{X;Yi\Ui), (5) 
=/(f/2;ri|t/i). 

In the second step, for each Vi = vi a new random variable V2{vi) is found such that the following 
are held constant: p(x\vi), the marginal distribution of X conditioned on Vi = Vi, H(Yi\U2,Vi = vi), 
and H{Y2\U2,Vi = vi). Again standard arguments imply that there exists a random variable V2{vi) and 
transition probabilities p{x\v2{vi)), with cardinality of Vi bounded by + 1, such that the above 
equalities are achieved. This in particular implies that 

I{V2{Viy,Y2\Vi) = I{U^;Y2\Vi) = I{U2;Y2\Ui), (6) 
IiV2{V^);Y,\V,) = IiU!,;Y,\V,) = IiU2;Y,\U^). 

Now, set V2 = (Vi, ^2(^1)) and observe the following as a consequence of Equations ^ and 

I{V2; Y2) = I{Vi; Y2) + imVi); Y2\Vi) = I{Ui; Y2) + /(f/2; Falf/i) = I{U2; Y2), 
I{X- Y^\V2) = I{X- Y^\V^) - I{V2{V{)- FilVl) = /(X; Y^\U^) - /(f/2; Y^\U^) = I{X : Y^\U2). 

We thus have the required random variables Vi, V2 satisfying the cardinality bounds +4 and (HA*!! + 
4)(||A'|| + 1), respectively as desired. 

IV. Multilevel Product Broadcast Channel 

In this section we show that the BZT region can be strictly smaller than the capacity region in TheoremfT] 
Consider the product of 3-receiver broadcast channels given by the Markov relationships 

Xi Y21 Yii Ysi, 

X2 Y12 132- (V) 

In Appendix H] we derive the following simplified characterizations for the capacity and the BZT regions. 

Proposition 1: The BZT region for the above product channel reduces to the set of rate pairs (Rq, Ri) 
such that 

Ro<HVi;Y,,)+I{V2;Y,2), (8a) 
Ro<IiVv,Y2i), (8b) 
i?i < /(Xi; FnlV^i) + /(X2; Fi2|V^2), (8c) 
for some p{vi)p{v2)p{xi\vi)p{x2\v2) ■ 

Proposition 2: The capacity region for the product channel reduces to the set of rate pairs (Rq, Ri) 
such that 

Ro<HVu;Yu) + I{V,2;Y,2), (9a) 

Ro<nV2r,Y2,), (9b) 

Ro + Ri< HVu; I31) + /(V12; I32) + /(Xi; Fnll^ii) + /(X2; ^121^12), (9c) 

Ro + Ri< HV21; Y21) + /(Xi; Yu\V2i) + /(X2; Y^2\Vi2), (9d) 
for some p{vii)p{v2i\vn)p{xi\v2i)p{vi2)p{x2\vi2). 

Now we compare these two regions via the following example. 
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Example: 

Consider the multilevel product broadcast channel example in Figure [2l where: Xi = X2 = 3^12 = 
3^21 = {0,1}, and = ysi = 3^32 = {0,^,1}, Y21 = Xi, F12 = X2, the channels F21 ^ and 
Y12 Y32 are binary erasure channels (BEC) with erasure probability |, and the channel Yu — > Y31 
is given by the transition probabilities: P{Ysi = E\Yii = E} = 1, P{Ysi = E\Yu = 0} = PjFsi = 
E\Yii = 1} = 2/3, P(r3i = 0|Fii = 0} = P{F3i = = 1} = 1/3. Therefore, the channel Xi ^ Y31 
is effectively a BEC with erasure probability 5/6. 




Fig. 2. Product multilevel broadcast channel example. 



The BZT region can be simplified to the following. 

Proposition 3: The BZT region for the above example reduces to the set of rate pairs (Rq, Ri) satisfying 

ii,<,ni„{| + |,p}, 

Ri < ^ + 1 - «. (10) 

for some < p, g < 1. 

The proof of this proposition is given in Appendix IB It is quite straightforward to see that (_Ro, Ri) = 
(|, ^) lies on the boundary of this region. 

The capacity region can be simplified to the following 

Proposition 4: The capacity region for the channel in Figure [2] reduces to set of rate pairs {Rq, Ri) 
satisfying 

Ro < minj^ + 1'*^} ' 

( r s 1 — r 1 —t ^ 

Ro + Ri <min<^- + - + ^— + l-s,t+^- + l-sy, (11) 

for some 0<r<t<l,0<s<l. 

The proof of this proposition is also given in Appendix HI Note that substituting r = t yields the BZT 
region. By setting r = 0, s = 1, t = 1 it is easy to see that {Rq, Ri) = (1/2, 1/2) lies on the boundary of 
the capacity region. On the other hand, for Rq = 1/2, the maximum achievable -Ri in the BZT region is 
5/12. Thus the capacity region is strictly larger than the BZT region. 
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-+- ' BZT region 
O Capacity region 




Fig. 3. The BZT and the capacity regions for the channel in Figure (2] 



Figure [3] plots the BZT region and the capacity region for the example channel. Both regions are 
specified by two line segments. The boundary of the BZT regions consists of the line segments: (0, 3/2) 
to (0.6, 0.2) and (0.6, 0.2) to (2/3, 0). The capacity region on the other hand is formed by the pair of line 
segments: (0,3/2) to (1/2, 1/2) and (1/2, 1/2) to (2/3,0). Note that the boundaries of the two regions 
coincide on the line segment joining (0.6,0.2) to (2/3,0). 

Remarks: 

1) Consider a 3-receiver Gaussian product multilevel broadcast channel, where 

Y21 = Xi + Zi, Yii = 1^21 + ^2, ^31 = ^11 + ^3, 

1^22 = X2 + ^4, ^32 = ^22 + Z^. 

The noise power for Zi is Ni for i = l,2,...,5. We assume a total average power constraint P on 

X=(Xi,X2). 

Using Gaussian signalling it can be easily shown that the BZT region is the set of all (i?o, Ri) such 
that 



Ro<C 
Ro<C 
Ri<C 



aPi 



aPi + N1 + N2 + N3 
aPi 



+ C 



(3{P-P,) + N, + N, 



(12) 



aPi + Ni 
aPi 



+ C 



Kp - Pi) 



for some < Pi < P, < a, /5 < 1. 

Now if we use Gaussian signaling to evaluate region Q, one obtains the achievable rate region 
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consisting of the set of all {Ro,Ri) such that 



Rn<C 



aiPi + Ni + N2 + N3 J \a2{P - Pi) + + 

{ai + h)Pi 
'l-a,-h)P, + N, 



, J . (13) 



a2(P-Pi) + iV4 + AT; 



^ < , , ((i^^i^w V , /(I - a. - 6.)(p - Pi: 



(ai + 6i)Pi 



;i-ai-&i)Pi + iVi, 
for some < Pi < P, < ai, 02, &i, &2, ai + &i, 02 + &2 < 1- 

Now consider the above regions with the parameters values: P = 1, A^i = 0.4, N2 = = 0.1, A^4 = 
0.5, A^5 = 0.1. Fixing Pi = 0.5 log(0.49/0.3), we can show that the maximum achievable Rq 
in the Gaussian BZT region is 0.5 log(2. 0566...). On the other hand, using the parameter values 
61 = 0.05, 1 - ai = 0.1725, 1 - as = 0.5079, and Pi = 0.5680 for the region given by the 
pair (0.5 log(2.0603), 0.5 log(0.49/0.3)) is in the exterior of the region. Thus restricted to Gaussian 
signalling the BZT region ([8]) is strictly contained in region However, we have not been able 
to prove that Gaussian signaling is optimal for either the BZT region or the capacity region. 
2) The reader may ask why we did not consider the more general product channel 

Xi Y21 — > Yii I31, 

X2 Y12 Ys2 Y22- 

In fact we considered this more general class at first but were unable to show that the capacity 
region conditions reduce to the separated form 

Po </(l^ii; 1^31) + /(^i2; 1^32), 
Po</(V^2i; 1^21) + /(V^22; 1^22), 

Pq + Pi < HVn; Fsi) + liVu; Y,2) + /(Xi; Yn\Vn) + /(X2; Fi2|V^i2), 
Pq + Pi < /(1^2i; 1^21) + /(V^22; Y22) + /(Xi; Y^,\V2l) + /(X2; Fi2|V^i2), 

for some p(t;ii)p(t;2i|^^ii)p(2;i|^^2i)p(^^i2)p(^22|^^i2)p(a;2|^22)- 



V. Bounds on Capacity of General 3-Receiver Broadcast Channel with Degraded 

Message Sets 

In this section we extend the results in Section |In] to obtain inner and outer bounds on the capacity 
region of general 3-receiver broadcast channel with degraded message sets. We first consider the same 
2 degraded message set scenario as in Section |III] but without the condition that X ^ Fi — > F3 form a 
degraded broadcast channel. We establish inner and outer bounds for this case and show that they are 
tight when the channel X — > Fi is less noisy than the channel X — > Ya, which is a more general class 
than degraded broadcast channels [13]. We then extend our results to the case of 3 degraded message 
sets, where Mq is to be sent to all receivers. Mi is to be sent to receivers Yi and Y2 and M2 is to be 
sent only to receiver Yi. A special case of this inner bound gives an inner bound to the capacity of the 2 
degraded message set scenario where Mq is to be sent to all receivers and Mi is to be sent to receivers 
Yi and Y2 only. 
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A. Inner and Outer Bounds for 2 Degraded Message Sets 

We use superposition coding, indirect decoding, and the Marton achievability scheme for the general 
2-receiver broadcast channels [14] to establish the following inner bound. 

Proposition 5: A rate pair (Ri, R2) is achievable in a general 3-receiver broadcast channel with 2 
degraded message sets if it satisfies the following inequalities: 

Ro<mm{I{U2;Y2),I{U3;Y,)}, 
2Ro < /(f/2; Y2) + /(f/3; F3) - /(f/2; Us\U^), 
R^<mm{I{X■,Y,\U2) + IiX■,Y^\Us),IiX■,Y^\Ul)}, (14) 
Ro + Ri< min{/(X; Y,), /(t/2; Y2) + /(X; Y,\U2), /(t/3; F3) + /(X; Y,\U,)}, 
2Ro + Ri< I{U2; Y2) + I{U^; ^3) + I{X- Y^\U2^ Us) - I{U2^ Us\Ui), 
2Ro + 2R, < I{U2, Y2) + /(X; Y,\U2) + /(f/3; Y^) + /(X; Yilf/3) - I{U2^ Us\U{), 

for some p{ui,U2,Us,x) = p{ui)p{u2\ui)p{x,U3\u2) = p{ui)p{u3\ui)p{x,U2\u3) (or in other words, both 
Ui^U2^ {U3,X) and Ui ^ U3 ^ {U2,X) form Markov chains). 

Proof: The general idea of the proof is to represent Mq by f/i, superimpose two independent pieces of 
information about Mi to obtain U2 and U3, respectively, and then superimpose the remaining information 
about Ml to obtain X. Receiver Yi decodes t/i, f/2, t^s, X, receivers Y2 and Y3 find Mq via indirect 
decoding of U2 and f/3, respectively, as in Theorem [TJ We now provide an outline of the proof 

Code Generation: Let Ri = Si + S2 + S3, where the Si > 0, i = 1,2,3 and T2 > 52, T3 > 6*3. 
Fix a probability mass function of the required form, p(ui,U2,U3, x) = p{ui)p{u2\ui)p{x,U3\u2) = 

p{ui)p{u3\ui)p{x,U2\u3). 

Generate 2"^° sequences f/"(mo), nio G [1, 2"^°] distributed uniformly at random over the set of typical 
f/" sequences. For each f/"(mo) generate 2"-^^ sequences f/^(mo,t2)j ^2 £ [1,2'^'^2] distributed uniformly 
at random from the set of conditionally typical f/^ sequences, and 2""^^ sequences f/3 (mo, ^3), ^3 G [1, 2"^^] 
distributed uniformly at random over the set of conditionally typical f/3 sequences. Randomly partition 
the 2"^2 sequences f/;J'(mo, ^2) into 2"'^^ equal size bins and the 2"^=^ f/3 (mo, ^3) sequences into 2""^^ equal 
size bins. To ensure that each product bin contains a jointly typical pair (f/^(mo, ^2), f^3*(^o, ^3)) with 
arbitrarily high probability, we require that (see [15] for the proof) 

^2 + ^3 < T2 + T3 - /(f/2; f/3|f/l). (15) 

Finally for each chosen jointly typical pair (f/^(mo, ^2), f^s ("^0, ^3)) in each product bin (s2, S3), generate 
2nSi sequences of codewords X"(mo, S2, S3, si), si G [1,2""^^] distributed uniformly at random over the 
set of conditionally typical X" sequences. 

Encoding: 

To send the message pair (mo, mi), we express mi by the triple (si,S2,S3) and send the codeword 

X"(mo,S2,S3,Si). 

Decoding: 

1) Receiver Yi declares that (mo, S2, S3, si) is sent if it is the unique rate tuple such that is jointly 
typical with ((f/"(mo), f/^(mo, ^2), f/3("^o, ^3), X"(mo, S2, S3, si)), and S2 is the product bin number 
of f/^(mo, ^2) and S3 is the product bin number of f/^(mo, t3). Assuming (mo, Si, S2, S3) = (1, 1, 1, 1) 
is sent, we partition the error event into the following events. 

a) Error event corresponding to mo 7^ 1 occurs with arbitrarily small probability provided 

Ro + Si + S2 + S3< /(X; Fi). (16) 

b) Error event corresponding to mo = 1, S2 7^ 1, S3 7^ 1 occurs with arbitrarily small probability 
provided 

Si + S2 + S3<IiX;Yi\Ui). (17) 
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c) Error event corresponding to rriQ = 1, S2 = 1, -^s 7^ 1 occurs with arbitrarily small probability 
provided 

S, + Ss< I{X- Yi|f/i, f/2) = /(X; rilf/2). (18) 

The equality follows from the fact that Ui ^ U2 {Us,X) form a Markov Chain. 

d) Error event corresponding to mg = 1, S2 7^ 1, S3 = 1 occurs with arbitrarily small probability 
provided 

^1 + ^2 < /(X; Yi\Uu f/3) = /(X; Yi\U^). (19) 

The above equality uses the fact that f/i ^ f/3 ^ (f/2?X) forms a Markov chain. 

e) Error event corresponding to uiq = 1, S2 = 1, S3 = 1, si 7^ 1 occurs with arbitrarily small 
probability provided 

S, < /(X; Fi|f/i, U2, f/3) = /(X; Y,\U2, f/3). (20) 

Note that the equality here uses a weaker Markov structure Ui (f/2, f/3) — * X. 

Thus receiver Yi decodes (mo, S2, S3, si) with arbitrarily small probability of error provided equations 
dUll-dlOl) hold. 

2) Receiver Y2 decodes niQ via list decoding of f/^ (7/10,^2) (as in Theorem [T])- This can be achieved 
with arbitrarily small probability of error provided 

Ro + T2<I{U2;Y2). (21) 

3) Receiver Y3 decodes tuq via list decoding of f/^(mo,t3) (as in Theorem [T]). This can be achieved 
with arbitrarily small probability of error provided 

i?o + T3</(f/3;F3). (22) 

Combining equations (fT5])- (|22l) and using the Fourier-Motzkin procedure [16] to eliminate T2, T3, S'l, 5*2, 
and Ss, we obtain the inequalities in (fT4l) . The details are given in Appendix HIl ■ 

Remarks: 

1) The above achievability scheme can be adapted to any joint distribution p(ui,U2,U3,x). However 
by letting f72 = (f/2, f/i) and letting f/3 = (f/3, f/i) we observe that the region remains unchanged. 
Hence, without loss of generality we assume the structure of the auxiliary random variables as 
described in the proposition. It is also interesting to note that the auxiliary random variables in the 
outer bound described in the next subsection also possess the same structure. 

2) An interesting choice of the auxiliary random variables is to set f/2 or f/3 equal to Ui (i.e., only one 
of the the receivers tries to indirectly decode Mq), say let f/3 = f/i. This reduces the inequalities |5] 
(after removing the redundant ones) to: 

i?o<min{/(f/2;F2),/(f/i;>^3)}, 

i?i </(X;ri|f/i), (23) 
Ro + Ri< min{/(X; Y,), /(f/2; F2) + /(X; Y^\U2), /(f/i; F3) + HX; Y,\U,)}, 

where f/i — f/2 — X form a Markov chain. 

This region includes the capacity region of the multilevel case in Theorem [B It is easy to verify 
that for any Ui ^ U2 ^ X that form a Markov chain, the comer points of the region in Theorem [T] 
satisfy the above inequalities (and this suffices by convexity). 

We now establish the following outer bound 
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Proposition 6: Any achievable rate pair {Rq,Ri) for the general 3-receiver broadcast channel with 2 
degraded message sets must satisfy the conditions: 

Ro < min{/(f/i; Yi), /(f/2; Y2) - /(t/2; Yi\Ui), I{Us; Y^) - I{Us; Filf/i)}, 
R^<IiX;Y,\U^). 

for some p{ui,U2,U3,x) = p{ui)p{u2\ui)p{x,U3\u2) = p{ui)p{u3\ui)p{x,U2\u3), i.e., the same structure 
of the auxiliary random variables as in Lemma [51 Further one can restrict the cardinalities of t/i, f/2, t/3 
to: llWill < \\X\\ + 6, IIW2II < {\\X\\ + 1)(||A'|| + 6), and H^all < (||^|| + 1)(||'^'|| + 6). 

Proof: The proof follows largely standard arguments. The auxiliary random variables are identified 
as Uii = (Mo, 17^^), U2i = (f/ii,y2"+i)' = (f/ii, With this identification inequalities Rq < 

I{Ui;Yi) and Ri < I{X;Yi\Ui) is immediate. The other two inequalities also follow from standard 
arguments and is briefly outlined here. 

nRo <nXn + J2 ^(^0; Y,i\Y, "l^,) 

i 

<nXr, + j2 ^Mo, F3 1+1, yr'; y3^) - nyr'-, ysiiMo, y^ -+1) 

i 

nXn + ^(^0' ^3 m, yi-'; y^) - I{Y, "+1; YuW,, Y^^) 

i 

= nA„ + 5^ /(f/3.; Fs.) - /(f/3.; Yi^Uu). 

i 

where (a) uses the Csiszar sum equality. 

The cardinality bounds are established using a similar argument as in IIII-CI To create a set of new 
auxiliary random variables with the bounds of Proposition [6l we first replace U2 by {U2, Ui) and f/3 by 
{U-i,Ui). It is easy to see from the Markov chain relationships Ui ^ U2 ^ {U^,X) and f/i ^ f/3 ^ 
{U2,X) that the following region is same as the that of Proposition [6l 

Ro < min{/(f/i; Y,), I{Uu U2; Y2) + I{X; Yi\Uu U2) - /(X : Y^\Ui), 

liU,, Us; Y,) + /(X; Y,\Uu f/3) - HX : Y,\U,)}, (24) 
Ri<I{X;Yi\Ui). 

Then using standard arguments one can replace Ui by f/j* satisfying < HA"!! + 6, such that the 

distribution of X and H{Yi\Ui), H(Yi\Ui,U2), H{Yi\Ui,U^), H{Y2\Ui), H{Y2\Ui,U2), H{Yj\Ui), and 
H(Y-i\Ui, f/3) are preserved. Now for each = ui one can find f/|(Mi) with cardinality less than HA"!! + 1 
each such that the distribution of X conditioned on f/j" = ui, H(Yi\Ui = ui, U2), and H{Y2\Ui = ui, U2) 
are preserved. Similarly one can find for each = Ui, a random variable U^{ui) with cardinality less 
than + 1 each such that the distribution of X conditioned on Ui = ui, H{Yi\U* = ui,U-i), and 
HiYslU* = ui, f/3) are preserved. This yields random variables f/j*, f/|, f/3 that preserve the region in (|24l) . 
(Note that as the distribution of X conditioned on Ui = ui is preserved by both f/|(ui) and f/3 (ui), it is 
possible to get a consistent triple of random variables f/j", f/g, t/|.) Finally setting Ui = UI, U2 = (f/i , f/2) 
and f/3 = (f/*, f/3) gives the desired bounds on cardinality as well as the desired Markov relations. ■ 

Remarks: 

1) The above outer bound appears to be very different from the inner bound of Proposition [5l However, 
by taking appropriate sums of the inequalities defining the region of Proposition [6l we arrive at the 
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conditions 

i?o < min{/(f/2; Y^) - /(f/2; Y^\Ui), I{Us; Y^) - /(t/3; Filf/i)), 
i?i</(X;yi|f/i), 

Ro + Ri< min{/(X; Yl), /(t/2; 1^2) + I{X; Y^p^), /(t/3; Kj) + HX; Y,\U,)}, 
2Ro + Ri< /(t/2; Y2) + /(t/3; Fa) + /(X; Y,\U2, f/3) - /(f/2; f/slf/i) + /(t/2; f/sl^i, f/i). 

These conditions include some redundant ones, but are closer in structure to the inequalities defining 
the inner bound of Proposition [51 

2) The outer bound in Proposition [6] reduces to the capacity region for the multilevel case in TheoremfT] 
To see this observe that when X ^ Yi ^ Y^ form a Markov chain, 

Ro < I{U3; 1^3) - HUs; Y,\Ui) < liU^; F3) - /(f/3; Y,\Ui) = I{Ui; Y,). (25) 

Further from Ri < I{X; Fi|f/i), we have Ro + Ri< /(f/i; I3) + /(X; Fi|f/i). Thus the outer bound 
is contained in the achievable region of Theorem [U i.e., 

Ro < min{/(f/i; ^3), /(f/2; F2)}, (26) 
Ro + Ri< {/(f/i; Y,) + /(X; Yi|f/i), /(f/2; 1^2) + /(X; Y^p^)}. 

3) The inner and outer bounds match if Yi is less noisy than Y3 [13], that is if /(f/; F3) < /(f/; Fi) 
for all p{u)p{x\u) . As shown in [13], this condition is more general than degradedness. As such, 
it defines a larger class than multilevel broadcast channels. 

Proposition 7: The capacity region for the 3-receiver broadcast channel with 2 degraded message 
sets when Yi is a less noisy receiver than ^3 is given by the set of rate pairs (Rq, Ri) such that 

Ro < min{/(f/i; F3), /(f/2; F2)}, (27) 
Ro + Ri< min{/(f/i; Y,) + /(X; Y,\U,), /(f/2; ^2) + /(X; Y.lU^)}, 

for some p{ui)p{u2\ui)p{x\u2) ■ 

From the definition of less noisy receivers [13] we have /(f/3;F3|f/i = ui) < /(f/3;Fi|f/i = ui) 
for every choice of ui and thus /(f/3; F^jf/i) < /(f/3; Fi|f/i) for every p(ui)p{us\ui)p{x\u-i) . Using 
(|25] ) it follows that the general outer bound is contained in (|271) . The comer point of (ITTI) (under 
the less noisy assumption) is contained in the region given by ([23] ) and thus achievable by setting 
f/3 = f/i in the region of Proposition [S] 

B. Inner Bound for 3 Degraded Message Sets 

In this section we establish an inner bound to the capacity region of the broadcast channel with 3 
degraded message sets where Mq is to be sent to all three receivers. Mi is to be sent only to Yi and 
F2, and M2 is to be sent only to Yi. We then specialize the result to the case of 2 degraded message 
sets scenario, where Mq is to be sent to all receivers and Mi is to be sent to Yi and Y2 and establish 
optimality for two classes of channels. 

The inner bound we establish is closely related to that of Proposition [5l To explain the connection, 
consider a 3-receiver broadcast channel scenario where message Mq is to be sent to all three receivers, 
message M12 is to be sent to receivers Yi and Y2, message A/13 is to be sent to receivers Yi and Fj, and 
message Mn is to be sent only to receiver Fi. An inner bound to the capacity region for this scenario that 
uses superposition coding and Marton's coding scheme would be to represent Mq by an auxiliary random 
variable Ui, (Mo, Mu) by an auxiliary random variable f/2, (Mo,A/i3) by f/3, and (Mq, M12.M13, Mn) 
by X, where f/i — > f/2 — ^> (f/3,X) and f/i ^ f/3 (f/2,X) form Markov chains. 

The inner bound of Proposition [5] follows from the above scenario by relaxing the conditions that 
F2 needs to decode M12 and F3 needs to decode M13 and considering both messages as parts of the 



15 



private message to receiver Yi. However, instead of eliminating the auxiliary random variables U2 and f/3 
completely (as in the BZT region, which is a straightforward extension of the Komer-Marton scheme), we 
keep them and have receivers Y2 and 13 use the new technique of indirect decoding to find Mq through 
U2 and f/3, respectively. As we have shown in Section |IVl having these random variables U2 and f/3 can 
strictly improve the achievability region of the 2-message sets scenario. 

Now consider the 3 degraded message set scenario. We relax the condition in the above scenario that 
Y3 needs to decode M13. Recall the proof of Proposition |5l We let Ri = S2, R2 = S3 + 5*1 and represent 
Mo by Uu {Mo, Ml) by U2, {Mq, S3) by f/3, and (Mo,Mi,M2) by X. Receiver Yi finds (Mo,Mi,M2) 
by decoding f/i, f/2, f/3, X; receiver Y2 finds (Mq, Mi) by decoding f/i, f/2; and receiver Y3 finds Mq by 
indirectly decoding Ui through f/3. We obtain the following conditions for achievability of any rate tuple 
{Ro, Ri, S3, Si) by replacing S2 by Ri in conditions (fT5])- (|22l) and adding the condition T2 < /(f/2; Y2\Ui) 
(to enable Y2 to completely decode f/2). 

Ri < T2, 

S3 < T3, 

Ri + S3<T2 + T3-I{U2;U3\Ui), 
Ro + Si + Ri + S3<I{X;Yi), 

Si + S3<I{X;Yi\Ui, f/2) = I{X; Filf/2), 

Si + Ri< /(X; Yi\Ui, f/3) = /(X; Filf/3), (28) 
Si + Ri + S3<IiX;Yi\Ui), 

Si < I{X-Yi\Ui,U2,U3) = I{X;Yi\U2,U3), 
Ro + T2< I{Ui, f/2; Y2) = /(f/2; Y2), 

T2<I{U2;Y2\Ui), 
R0 + T3 < I{Ui,U3;Y3) = I{U3;Y3), 

for some p{ui,U2,U3, x) = p{ui)p{u2\ui)p{x,U3\u2) = p{ui)p{u3\ui)p{x,U2\u3). 

Performing Fourier- Motzkin procedure to eliminate the variables Si, S3, T2 and T3 yields the following 
achievable region. 

Theorem 2: A rate triple {Rq, Ri, R2) is achievable in a general 3-receiver broadcast channel with 3 
degraded message sets if it satisfies the conditions: 

Ro<I{U3;Y3) 

Ri < min{/(f/2; ^21^1), /(X; Yi\U3)}, 
R2<I{X;Yi\U2) 
Ro + Ri< min{/(f/2; Y2), /(f/2; Islf/i) + /(f/3; Y3) - /(f/2; f/slf/i)}, 
2Ro + Ri< /(f/2; Y2) + /(f/3; Y3) - /(f/2; f/3|f/i), 
Ro + R2< /(f/a; Y3) + /(X; Yi\U2, f/3) 
Ri + R2<IiX;Yi\Ui), 
Ro + Ri + R2< min{/(X; Yi), /(f/3; ^3) + /(X; rilf/3), 

IiU2; Y2\Ui) + /(f/3; Y3) + /(X; Yi\U2, f/3) - /(f/2; f/slf/i)}, (29) 
2/?o + /?i + /?2 < /(f/2; I2) + /(f/3; >3) + /(X; ri|f/2, f/3) - /(f/2; f/3|f/i), 
/?o + 2Ri + R2< /(f/2; F2|f/i) + /(f/3; F3) + /(^; Yi\U3) - /(f/2; f/3|f/i), 
2/?o + 2Ri + R2< /(f/2; 1^2) + /(f/3; 1^3) + /(X; rilf/3) - /(f/2; f/3|f/i). 

for some p{ui,U2,U3,x) = p{ui)p{u2\ui)p{x,U3\u2) = p{ui)p{u3\ui)p{x,U2\u3) (i.e., as before both Ui 
^ f/2 ^ {U3,X) and f/i ^ f/3 ^ (f/2,X) form Markov chains). 
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Remark: The region of Theorem [2] reduces to the inner bound of Proposition [5] by setting Ri = 0. The 
equivalence between the two descriptions is proved in Appendix Hill 

We now consider a 2 degraded message set scenario where Mq is to be sent to all receivers and Mi is 
to be sent to receivers Yi and Y2. The following inner bound follows from Theorem [2] by setting R2 = 0. 

Corollary 1: A rate pair {Rq, Ri) is achievable in a 3-receiver broadcast channel with 2 degraded 
message sets, where Mq is to be decoded by all three receivers and Mi is to be decoded only by Yi and 
Y2 if it satisfies the following conditions: 

Ro<I{U;Y,), 

R,<mm{I{X;Y2\U),IiX;Y^\U)}, (30) 
i?o + ^1 < min{/(X; Y^), /(X; Fi)}, 

for some p{u)p{x\u). 
Remarks: 

1) Region (|30l ) coincides with the straightforward extension of the Korner-Marton 2-receiver region. 

2) By setting R2 = 0, U2 = X, and = Ui = U the region in Theorem [2] reduces to (l30l) . Thus 
region in (l30l) is contained in region (|29l ). 

3) It may seem that the region obtained by setting = in (|29l ) is larger than region (l30l) . but they 
are in fact equal, variables, we see that Therefore, there is no need to introduce t/s. To prove this, 
observe that 

Ro + Ri< /(f/2; Y2\Ui) + /(t/3; Fa) - /(f/2; U^Wi) 

= I{Us; Ys) + /(f/3; Y2\Ui) + /(t/2; ^21^73) - /(f/3; r2|t/2) - /(f/3; t/2|t/i) 

= /(f/3; ^3) + /(f/2; 1^2|f/3) - /(f/3; U2\Y2, f/l) 

< J(f/3;r3) + /(X;r2|f/3). 
Thus the rate pairs must satisfy the following inequalities 

Ro<imY,), 

Ro + Ri< mm{I{Us; Ys) + /(X; Y2\U3), /(f/3; ^3) + /(X; Yi\Us)}, (31) 
Ro + Ri< min{/(X; F2), /(X; Fi)}. 

Clearly this is contained inside region (l30l) and hence region (|29l) reduces to the one in Corollary 
[Uwhen i?2 = 0. 

4) Inner bound (|30l) is optimal for the following two special classes of broadcast channels. 

Proposition 8: Achievable region (|30l) is tight for deterministic 3-receiver broadcast channels. 

It is straightforward to show that the set of rate pairs (_Ro, Ri) such that 

Ro<min{H{Y,),H{Y2),H{Y,)}, 
Ro + Ri<mm{H{Y2),H{Y^)}, 

for some p(x) constitutes an outer bound on the capacity region. To show achievability, we need 
only consider the three choices for U: (i) U = Y3, and (ii) U = X, and (iii) f/ = 0. 

Proposition 9: Achievable region (|30l) is optimal when Yi is a less noisy receiver than Y3 and Y2 
is a less noisy receiver than ^3. 

Note that this result generalizes Theroem 3.2 in [4] where the authors assume the receivers are Y2 
and Yi are degraded versions of Y3. To show optimality, we set Ui = (Mq, Y^^^) and thus the only 
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non-trivial inequality in the converse is Ri < ■aim{I{X;Yi\U), I{X;Y2\U)}. To see this observe 
that 

nRi <J2liMi;Yu\Mo,Y,",^,) 

i 

< HMi; Yu\Mo, + E ^(^3"'; ^i^l^O' ^iVi)) 

i i 

J2 mu nV; ^hI^o, y;-') - J2 ^(nV; ^hI^o, y^-') + J] ^3^|Mo, rr^) 

i j i 

< J]/(x,;rH|Mo,rr^), 

where (a) uses the Csiszar sum equality and (6) uses the assumption that Yi is a less noisy than F3, 
which implies that /(Fi'^i+i; IsilMo, Y^-^) < I{Y^i+i; Fii|Mo, Kj^"^). The bound Ri < I{X; Y2\U) 
can be proved similarly. 

C. Inner Bounds for k-receiver Broadcast Channels 

The inner bounds discussed in previous subsections suggest the following extension to general A; -receiver 
broadcast channel scenarios with given message requirements. To illustrate our procedure we shall use 
the running example of a 3-receiver broadcast channel with 3 messages to receiver subsets: {1}, {1, 2}, 
and {2, 3}. 

To obtain an inner bound to capacity for a given message requirement, we first consider all nonempty 
receiver subsets. Let So be the collection of subsets specified by the message requirements. For each 
A e Sd, we introduce an auxiliary random variable for every B D A. Thus in our example, Sd — 
{{1},{2,3},{1,2}}, and five auxiliary random variables are introduced corresponding to the subsets: 
{1,2,3}, {1,2}, {1,3}, {2,3}, and {1}. Let Sj denote the receiver subsets for which auxiliary random 
variables are introduced but are not in So. In the example, Sj = {{1, 3}, {1, 2, 3}}. 

The receiver subsets with auxiliary random variables assigned to them are classified into levels based on 
their cardinality with the lowest level subsets having the largest cardinality. There is a Markov structure 
between the variables as follows: if Ub represents the auxiliary random variable corresponding to the 
subset B and Ua represents the auxiliary random variable corresponding to the subset A C B, then one 
can set Ua = {Ub, Ua)- Thus an auxiliary random variable Ua corresponding to a subset A should contain 
all auxiliary random variables corresponding to the subsets B D A. For the running example. Level 1 
contains the subset {1, 2, 3}, Level 2 contains the subsets {1, 2}, {1, 3}, and {2, 3}, and Level 3 contains 
the subset {1}. The Markov relationships between these auxiliary random variables are defined by: 

Uu = (C/123, C/12), C/13 = (C/123, U,s), C/23 = (C/123, C/23), 

C/l = {[/l2,[/l3,i>i}. 

Code generation proceeds one level at a time beginning with the lowest level followed by the second 
lowest level, and so on. The codebooks corresponding to auxiliary random variables at each level are 
randomly generated conditioned on codewords at the lower level according to the Markov structure of the 
auxiliary random variables. Random binning is performed at each level to find jointly typical codewords 
to represent message products. 

Decoding is performed at receiver i as follows: let Tj represent the collection of receiver subsets that 
contains i for which auxiliary random variables are introduced. A subset A G T, is said to be minimal if 
there is no 5 e Ti such that B C A. Let T/^^'^ be the collection of minimal subsets in Tj. For the example 
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we obtain 

Ti = {{1, 2, 3}, {1, 2}, {1, 3}, {1}} and T^" = {1}, 

T2 = {{1, 2, 3}, {1, 2}, {2, 3}} and T^- = {{1, 2}, {2, 3}} , 

T3 = {1, 2, 3}, {1, 3}, {2, 3}} and Tg"- = {{1, 3}, {2, 3}} . 

By the Markov structure defined above, it is clear that all the messages for receiver i are represented by 
the auxiliary random variables W™™, which correspond to the elements of 7^"^*". The auxiliary random 
variables in W™*" n So represent private messages for receiver i, while those in W™*" n S'f^ contain only 
parts of private messages. Receiver i uses indirect decoding to find the private messages encoded into 
cloud centers by using the satellite codewords represented by the auxiliary random variables in W™*". 

In our running example, receiver 3 indirectly decodes t/23 using the pair (f/i3,?723)- That is, the rate 
constraints are such that receiver 3 may not be able to uniquely decode f/13 but is able to decode the correct 
t/23. However, receivers Y2 and Yi should be able to correctly decode (t/i2,t^23) and Ui, respectively, 
and hence these receivers impose the usual (direct) decoding constraints on the rates. In general, when 
jjram p| _ 0^ indirect decoding is not needed as in the Examples below, where as in Proposition |5] 
indirect decoding is needed. 

The following two examples show that the above procedure yields the best known inner bounds for 
special classes of broadcast channels. 

Example 1: 2-receiver broadcast channel where Mi is to be decoded by receiver Yi and M2 is to be 
decoded by Y2. We generate 3 auxiliary random variables corresponding to the three non-empty subsets 
of {1,2}: W for {1,2}, U for {1} and V for {2}. Setting U = {U,W) and V = {V,W) represents 
the Markov structure among the variables. Observe that the auxiliary random variables are exactly as in 
Marton's coding scheme and so is the code generation we outlined earlier. 

Example 2: A;-receiver broadcast channel with 2 degraded message sets, where Mq is to be decoded by 
receivers {1, . . . , /c} and Mi is to be decoded by {l,...,fc — 1}. The only subsets that we would assign 
auxiliary random variables to here are {1, . . . , /c} and {1,...,A; — 1}. We thus introduce the auxiliary 
random variable Ui for {1, . . . , A;} and U2 for {1, . . . , /c — 1}. The region is then be given by 

i?o</(f/i;n), 

Ro + Ri< /(f/2; Yi), for « = 1, . . . , A; - 1, 
Ri < /(f/2; Y,\Ui) for i = 1, . . . , - 1, 

where Ui ^ U2 ^ X ^ {Yi, . . . , Y^} form a Markov chain. Clearly in this case it is optimal to set 
U2 = X, which reduces the region to the straightforward extension of the Korner-Marton scheme. 

Remark: Our procedure can result in an explosion in the number of auxiliary random variables introduced 
even in simple scenarios. However, as we have shown in Section |IVl indirect decoding may be needed to 
achieve the capacity region for some classes of channels. Thus the introduction of such a large number 
of auxiliary random variables may indeed be necessary in general. 

VI. Conclusion 

Recent results and conjectures on the capacity of {k > 2)-receiver broadcast channels with degraded 
message sets [6], [4], [5] have lent support to the general belief that the straightforward extension of 
the Korner-Marton region for the 2-receiver case is optimal. This paper shows that this is not the case. 
We show that the capacity region of the 3-receiver broadcast channels with 2 degraded message sets can 
be strictly larger than the straightforward extension of the Korner-Marton region. The achievability proof 
uses the new idea of indirect decoding whereby a receiver decodes a cloud center indirectly through joint 
typicality with a satellite codeword. Using this idea, we devise new inner bounds to the capacity of the 
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general 3 -receiver broadcast channel with 2 and 3 degraded message sets and show optimality in some 
cases. The structure of the auxiliary random variables in the inner bounds can be naturally extended to 
more than 3 receivers. The bounds also provide some insight into how the Marton achievable rate region 
may be extended to more than 2 receivers. 

The results in this paper suggest that the capacity of the k > 2 -receiver broadcast channels with degraded 
message sets is as at least as hard to find as the capacity of the general 2-receiver broadcast channel with 
common and private message. However, it would be interesting to explore the optimality of our new inner 
bounds for classes where capacity is known for the general 2-receiver case, such as deterministic and vector 
Gaussian broadcast channels. It would also be interesting to investigate applications of indirect decoding 
to other problems, for example, 3-receiver broadcast channels with confidential message sets [11]. 
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Appendix I 
Proof of Propositions [B IH [31 and g] 

To prove Propositions [H [21 note that it is straightforward to show that each simplified characterization is 
contained in the original region as the characterizations are obtained by using the channels independently. 
So we only prove the other non-trivial direction. 

Proof of Proposition [T]- 

We prove that for the product broadcast channel given by dV]) the BZT region (O reduces to the 
expression ([8]). 

Consider the first term ([8a| ) in the BZT region 

= IiU;Y3i) + IiU;Ys2\Ysi) 
<IiU;Ys,) + IiU,Y,,;Y,2) 
</(t/;r3i) + /(f/,>^ii;V'32). 
Now set Vi = U and V2 = {U, Yn). Thus the above inequality becomes 

Ro<I{Vi;Ysi) + I{V2;Ys2). 
The second term ([8bl) in the BZT region is simply given by 

R^<I{Vi;Y2i). 

Finally, consider the last term ([8cl) 

Ri < I{X;Y\U) = I{XuX2;Yn,Yu\U) 

= HiYn, Y,2\U) - HiYn, Yu\X,, X2, U) 

= HiYu\U) + HiY,2\U, Yn) - H{Yn\Xn U) - H{Y,2\X2, U) 

= /(Xi; Yn\U) + H{Yi2\U, Y^) - H{Y,2\X2, U, Yn) 

= IiX,;Yn\V^) + IiX2;Y,2\V2). 

The fact that p{vi)p{v2)p{xi\vi)p{x2\v2) suffices follows from the structure of the mutual information 
terms. 

Proof of Proposition 

We prove that for the product broadcast channel ([7]) the capacity region given by Theorem \T\ reduces 
to the expression 

Consider the first term ([9a| ) in the capacity region 

Ro<I{Ui-Y,) = I{Ui;Y,,,Ys2) 

= I{UuYsi) + IiUi;Ys2\Ysi) 
<IiUi;Ysi) + IiUi,Yu;Ys2) 
<IiU,;Ysi) + IiU,,Yn;Ys2). 

Now set Vn = Ui and Vu = (f/i,Fii). 

The second term ([90) in the capacity region is Rq < I{U2;Y2i). Now set V21 = U2 and from Ui 
U2 {Xi,X2) we have Vn V21 — > Xi. Thus the second term can be rewritten as _Ro < -^(^21; Y21) 
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Consider the third term (|9c1 ) 

Ro + Ri<I{Uv,Y^)+I{X-Y^\Ui) 

= liUu Fsi, + /(Xi, X2; Yn, Yu\U,) 

< I{Uu F31) + /(f/i, Vli; 1^32) + ll2|f/i) - H{Yn, F12IX1, X2, t/i) 

= /(f/i; Fsi) + /(f/i, ^i; ^2) + i^(ni|f/i) 

+ H{Yi2\Ui, Yn) - H{Yn\Xi, Ui) - H{Yi2\X2, U,, Fn) 
= I{Viv, Fsi) + /(V12; >32) + /(Xi; Fnll^ii) + /(X2; ^121^^12). 
Finally consider the last term (l9dl) 

i?o + < /(f/2; 1^21) + /(X; Filf/2) 

= /(t/2; F21) + /(Xi, X2; Fn, >^i2|f/2) 

= nU2; F21) + ^^^(^-11, Y^2\U2) - H{Y,,, F12IX1, X2, f/2) 

< /(f/2; F21) + i/(rii|f/2) + H{Yi2\U2. Fii) - //(FiilXi, f/2) - i/(ri2|X2, f/2, Fii) 
= I{V2u Y21) + /(Xi; Fn|^2i) + /(X2; ri2|f/2, ^-n) 

= I{V2i\ Y21) + /(Xi; Fn|^2i) + /(X2; Y,2\U2. U,, Yn) 

< I{V2i] Y21) + /(Xi; Yn\V2i) + /(f/2, X2; Fi2|f/i, Fn) 

= /(V21; I21) + /(Xi; Fn|V^2i) + /(X2; Yi2\Vi2). 

The fact that p(fii)p(f2i)p(a;i|^2i)p(^i2)p(a^2|^i2) suffices follows from the structure of the mutual infor- 
mation terms. 

In the proof of propositions [3] and |4] we shall make use of the following simple fact about the entropy 
function [10]. 

H{ap, 1 — p, (1 — a)p) = H{p, 1 — p) + pH{a, 1 — a). 

Proof of Proposition \3j[ 

We prove that the region given by ([8]) reduces to (flOl) for the binary erasure channel described by the 
example in Section |IVl 

Let F{Vi = i} = at, P{Xi = Ol^i = i} = /i^. Then, 

\ i i / i ^ ' 

I{Vi] Y21) = H \ OijJi, X ai(l - /ii) I - X aiH{jji, I- Hi), 

\ i i / i 

I{Xn Yn\V,) = E (f ^ ' ^ "^^^^ ^ " 5Z " /^«)^ ^ 

i 

Similarly, let P{V2 = i} = A, P{X2 = 0\V2 = i} = Ui. Then 

/(^2;i3i) = i// |5^A^^i,5ZA(l-^^.)) - ^5^A//(i^.,l-'^.), 

\ j i / i 

I{X2;Y^2\V2) = Y,f3^H{u,,l - Ui). 
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Now setting Y,^j3iH{ui, 1 - z/j) = 1 - g, and J2i (^iH{fii, 1 - fii) = 1 - p, we obtain 



<i(l-(l-p)) = |. 



\ i i / i 



I{Xi-Yi,\Ui] 



< 1 - {1 - p) = p, 
1 — p 



\ j i / i 

/(X2;ri2|f/2) = i-g. 

Therefore, any rate pair in the BZT region must satisfy the conditions 

Ro < minjl + , 



^ 1 — p 

Ri < + 1 - g. 

for some < p, g < 1. 

It is easy to see that equality is achieved when the marginals of Vi are given by F{Vi = 0} = P{Vi 
1} = p/2, P{Vi = E} = 1 - p and the marginals of V2 are given by P{V2 = 0} = P{V2 = 1} 
q/2, P{V2 = E} = l-q, (see Figure g]). 



p/2 O g/2 O 





p/2 1 O 

Fig. 4. Auxiliary channels that achieve the boundary of the BZT region. 

Proof of Proposition 

We prove that the region ^ reduces to region (fTTI) for the binary erasure channel described by the 
example in Section HVl 

Assume that P{Vii = = ai,P{Xi = 0\Vn = i} = fii,P{Vi2 = i} = A,P{^2 = Ol^is = = 



uji. Further, there exist r,s,t G [0, 1] such that 



iy,,F{V2i = t}=^,,P{X, = 0\V2i = t}= ^ , , 

i 
i 

H{X,\V2i) = Y,lrH{u,, 1 - = 1 - t. 

i 

Clearly from the Markov condition Vu V21 ^ -^1, we require 1— t<l — ror equivalently r < 
We can also establish the following in a similar fashion. 



^(^11; >3i) = \Hy^ aifii, ai{l - fii)\ - ^ OiiHifii, 1 

\ i i / i 

\ i i / i 



r 

■ M < ^, 

D 



\ i i / i 

1 1 — r 



1-UJi) < t, 



i 

/(X2; Y,2\Vi2) = 5^ 1 - Z/,) = 1 - 

i 

J(Xi; Yu\V2i) = ^J2j,H{u;,, 1 - cu,) = 

i 

Thus any rate pair in the capacity region must satisfy 

Rn < min | — h -, 1 1 , 

for some 0<r<t<l,0<s<l. Note that 
Eaualitv in the above conditions is achiev( 



, r s 1 — r 1 — t 
< min < - ^ \ h 1 - s, t H h 1 - 

16 2 2 



r some us^SJSJ-,ussSi- Note that substituting r = t yields the BZT region. 
Equality in the above conditions is achieved by the choices of auxiliary random variables shown 
Figure [51 and thus the above region is the capacity region. 



00 




r/2 1 O 



Vii V21 Xi 

Fig. 5. Auxiliary channels that achieve the boundary of the capacity region. 
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Appendix II 

FOURIER-MOTZKIN ELIMINATION FOR PROPOSITION [5] 

In this section we provide the details of the Fourier-Motzkin procedure in the proof of Proposition [51 
To eliminate T2,T3 we need to consider the following set of inequalities 

^3 < ^3, 

52 + 53<T2 + T3-/(f/2;f/3|t/i), 

Ro + T2<IiU2;Y2), 
Ro + T3<IiU2;Ys). 

Elimination T2 first we end up with 

^3 < ^3, 

Ro + S2 + Ss< /(f/2; Y2) + T3 - I{U2; f/3|f/i), 
Ro + S2<IiU2;Y2), 
Ro + n<IiU2;Y,). 

Elimination of T3 in the above leads us to 

2i?o + ^2 + 53 < /(f/2; Y2) + /(f/2; Y,) - /(f/2; f/3|f/l), 
Ro + S2<IiU2;Y2), 
Ro + S3<IiU2;Y,). 

Thus any pair Rq, Ri = Si + S2 + that satisfies the following set of inequalities is achievable 

Si>0, 

S2>0, 
S3>0, 

Ro + S2<IiU2;Y2), 
Ro + S,<I{Us;Y,), 

2Ro + S2 + Ss< /(f/2; Y2) + /(f/3; F3) - /(f/2; f/3|f/i), 
Ro + S2 + S3 + Si<IiX;Yi), 

S2 + Ss + Si<I{X;Yi\Ui), 
S2 + Si<I{X;Yi\Us), 
Ss + Si<I{X;Yi\U2), 

5i</(X;Fi|f/2,f/3). 
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Substituting for Si = Ri — S2 — S3 yields 

S2>0, 
S3>0, 

S2 + S3 < Ri, 
Ro + S2<IiU2;Y2), 
Ro + S3<IiW,Y3), 
2Ro + S2 + S3< I{U2; Y2) + /(t/3; F3) - I{U2; U-^Wi), 
Ro + Ri<IiX;Yi), 

Ri<IiX;Yi\Ui), 

Ri<Ss + IiX;Yi\U3), 

Ri<S2 + I{X-Yi\U2), 

Ri<S2 + S3 + I{X-Yi\U2.U3). 

Elimination of 5*2 leads to 

0<53 

Ro + S3<I{U3-Y3), 
Ro + Ri<I{X-Yi), 

Ri<IiX;Yi\Ui), 

Ri<Ss + IiX;Yi\U;), 

S3 < Ri, 
Ro<HU2;Y2), 
2Ro + Ss< /(f/2; Y2) + nU;; Y,) - /(f/2; Us\Ui), 

S3<nX;Yi\U2), 
Ro + Ri<nU2;Y2) + IiX;Yi\U2), 
2Ro + Ri + S3< /(t/2; Y2) + /(t/3; 1^3) - /(f/2; UslUi) + I{X; rilf/2), 
< /(X; Yi\U2, U3) redundant, 
Ro + Ri< I{U2; Y2) + S3 + I{X; Yi\U2, f/3), 
2Ro + i?i < /(f/2; Y2) + /(f/3; F3) + Fi|f/2, f/3) - /(f/2; f/slf/i). 

Finally eliminating S3 (and removing redundant inequalities) leads one to the region in Proposition [5l 

Appendix III 
Proof of Remark 1 following Theorem [2] 

Consider the 3-receiver broadcast channel with 3 degraded message sets. Let R2 = Si + S2 + S^. The 
proof is in three steps: 
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(z) First, we show that any rate tuple {Rq, Ri, Si, S2, S3) is achievable provided 



Si + Ss<I{X;Yi\U2), 
Si + R, + S2<I{X;Y^\U,), 

Si + S2<IiX;Yi\U3,U2), 
Si<IiX;Yi\U2,U3), 
Ro + T2i+T22<I{U2;Y2), 

T2i<IiU2;Y2\U,), 
Ro + n<IiUs;Y,), 



for p{ui,U2,U2,U3,x) = p{Ui)p{u2\ui)p{u2\u2)p{x,U3\u2) = p{ui)p{u3\ui)p{u2, U2\u3)p{x\u2, U3) , 

i.e. Ui ^ U3 ^ {U2, U2,X) and f/i ^ f/2 — > f/2 ^ {U3,X) form Markov chains. 
{a) Then, we show that the region defined by (|32|) is equal to the inner bound in Theorem [21 
[iii] Finally we show that when Ri = 0, the conditions (|32|) reduce to conditions (fT5l)- (|22l) in the proof 
of Proposition m thus completing the proof of Remark 1. 

A. Achievability of Rates Satisfying (l32l) 

First we outline the achievability of any rate tuple {Rq, Ri, Si, S2, S3) that satisfies conditions (|32l) . 
Code generation is very similar to that in the proof of Proposition [51 We insert U2, an auxiliary random 
variable representing the information about Mi, between Ui and f/2; so for every f/"(mo) we generate 
2"T2i lJ2{mo,mi) sequences and randomly partition them into 2"'^^ bins. For each f/^(mo,mi), we 
generate 2"^22 f/^(mo, mi, ^21) sequences and randomly partition them into 2"'^2 bins. We then generate 
2nT3 u^i^Tpri^^ j^g) sequences and partition them into 2"'^^ bins. For each product bin ((mi, S2), S3) we select a 
jointly typical pair (t/^(mo, mi, ^2), U3{mo, t^)). Finally for product bin ((mi, S2), S3) with corresponding 
jointly typical ([/^(mo, mi, ^2), f^3^(mo, ts)) pair, we generate 2"'^i sequences X"(mo, mi, S2, S3, si). 

To ensure correct code generation (existence of relevant jointly typical sequences) we require that 



Ri < 
S2 < 

S3< 



T21, 
^22; 

T2i+T3-/(f/2;f/3|?7i), 

T21+T22+T3-/(t/2;f/3|?7i), 

IiX;Yi), 

I{X;Yi\Ui), 

IiX;Yi\U2), 



Ri + S3< 
Ri + S2 + Ss< 
-Ro + 'S'l + -Ri + 5*2 + 5*3 < 

^1 + i?l + ^2 + ^3 < 
Sl + S2 + Ss< 



(32) 




Ri + S3<T2i+T3-I{U2;U3\Ui), 
Ri + S2 + Ss< T21 + T22 + T3 - IiU2; Us\Ui). 
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Receiver Yi uses joint typicality to find (mo, mi, Si, S2, S3). The following conditions on the probability 
of error ensure successful decoding (the corresponding events that partition the error event are listed). 



Rq + Si 


+ Ri 


+ S2 


+ s. 


< /(^; 


Yi 


), 




(event: rho ^ 1 


) 


Si 


+ Ri 


+ S2 


+ S3 




Yi 






(event: 


(mo = 


l,mi ^ 1,S3^ 1)) 




Si 


+ S2 


+ s. 


< i{x- 


Yi 


K 




(event: 


{rho = 


l,mi = l,S2 7^(l,l),S3^1)) 






Si 


+ Ss 


< i{x- 


Yi 


U2] 




(event: 


(mo = 


l,mi = 1,S2 = (1,1), S3 ^ 1)) 




Si 


+ Ri 


+ S2 


< i{x- 


Yi 


Us] 




(event: 


(mo = 


1,S3 = l,mi 7^ 1)) 






Si 


+ S2 


< i{x- 


Yi 


U3, 


U2), 


(event: 


{rrio = 


1,S3 = l,mi = 1,S2 7^ (1,1))) 








Si 


< I{X; 


Yi 


f/2. 


U3), 


(event: 


(mo = 


1, S3 = 1, mi = 1, S2 = (1, 1), si 7^ 1)) 



Receiver Y2 decodes mo via indirect decoding using U2 and mi by decoding U2 conditioned on Ui. This 
is successful provided 

i?0 + T2i+T22</(t/2;F2), 

T21 < IiU2;Y2\Ui). 

Receiver Y3 decodes mo via indirect decoding using U3. This step succeeds provided 

Ro + Ts<IiUs;Y3). 

Combining the above conditions we see that any rate tuple satisfying (l32l) is achievable. 

B. Equivalence of Conditions (|32l) to Theorem |2] 

In one direction, setting U2 = U2, S2 = 0, T22 = and T21 = T2, we obtain (l28l) . Thus conditions (|32|) 
contain the region described by Theorem [21 

For the reverse direction we break down the argument into two cases. 

Case 1: T22< IiU2;Y2\U2) 

Observe that Y2 can also decode S2 and setting Ri = R1 + S2, R2 = R2 — S2, and T2i+r22 = T2 we see 
that conditions (El) along with T22 < /(t/2; Y2\U2) imply conditions dlH]). Thus under T22 < /(t/2; >2|f/2), 
the region described by (l32l) is contained in the region described by Theorem [21 

Case 2: T22 < I{U2\Y2\U2) 

If T22 > /(f/2; Y2\U2), then the condition R0 + T21+T22 < /(f/2; Y2) implies that R0 + T21 < /(f/2; Y2) 
and ^2 8 requirement for successful decoding can be changed to 

Ro + T2i<I{U2;Y2), 
T21 </(f/2;r2|f/i). 

In the rest of the inequalities, replacing f/2 by f72 only weakens them and hence it is optimal to set 
f/2 = f/2- These new inequalities imply ([281) in which we replace Si by Si + 6*2 and f/2 by f/2. Thus under 
T22 > /(f/2; ^2|f/2) also, the region described by ([321 ) is contained in the region described by Theorem[2l 
Combining Cases 1 and 2 we see that rate pairs satisfying conditions ((32l) is contained in the region 
described by Theorem [2l This completes the proof of their equivalence. 
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C. Reduction to Proposition \5\ 

If Ri = 0, the region described by conditions (|32|) reduce to 

< T21, 

'S'2 < 722; 

53 < T3, 

53<r2i + T3-/(f/2;[/3|f/i), 

^2 + ^3 < T21 + T22 + T3 - /([/2; t/3|t/l), 



Si + S2 + Ss< I{X; 


Yi 


U2), 


Ro + Si + S2 + S3< I{X- 


Yi 


), 


^1 + S2 + S3 < /(X; 


Yi 


Ui), 


Si + S3< I{X- 


Yi 


U2), 


Si + S2< i{x- 


Yi 


U3), 


Si + S2< i{x- 


Yi 


f/3,^2) 


Si < I{X; 


Yi 


U2,Us) 


Ro + T21 + T22 < I{U2 


Y2 


), 


T21 < I(U2 


Y2 


\Ui) 


Ro + T3< /(t/3 


Y, 


)■ 



Recalling that R2 = S2 + S3 + Si, and setting T22 = T21 + T22, observe that any (i?o, 5*1, S'2, S3) satisfying 
the above inequalities (|33l) also satisfies 

5*2 < T22, 

S3 < T3, 

S2 + S3<r22+T3-/(f/2;f/3|?7i), 

Ro + Si + S2 + S;<I{X-Yi), 

Si + S2 + S3</(X;ri|f/2), 

Si + S3</(X;Fi|f/2), 

Si + S2</(X;ri|f/3), 
Si + S2</(X;ri|[/3,f/2), 

Si</(X;ri|[/2,f/3), 
i?o + T22</(f/2;r2), 
Ro + T^<I{Uz\Y;). 

These conditions are clearly maximized by setting U2 = Ui which in turn reduces the equations to 
conditions (fT5l) -(l22l) of [51. Thus the region defined by (|33l) is contained in the region given by Proposition 
m The other direction is direct as the region in Proposition [5] is obtained by setting U2 = Ui in (|33l) . This 
completes the proof of Remark 1 . 

Remark: Observe that we do not need the auxiliary random variable U2 to characterize the region in 
either the 3 degraded message sets case (Theorem [J]) or the 2 degraded message sets case (Proposition [5]). 
This is in accordance with the structure of auxiliary random variables as prescribed by the remark in the 
introduction of Subsection |V-B[ 



